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DETAILED ACTION 
Claim Rejections - 35 USC § 102 

1. The following is a quotation of the appropriate paragraphs of 35 U S C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 35 1(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 

2. Claims 1-3, 5-15, 17-20, and 22 are rejected under 35 U.S.C. 102(e) as being anticipated 
by Goldberg et al. (U.S. Pub. No. 2002/0080170 Al). 

As to claim 1, Goldberg et al. discloses a system for processing a multimedia data file to 
provide information supporting user navigation of multimedia data file content (See abstract, 
also see page figure 6), comprising: 

a content parser to identify text and image content of a data file (See page 1 1, paragraph 

0219); 

an image processor for processing said identified image content to identify embedded text 
content (See page 12, paragraphs 0236-0238); 

a text sorter for parsing said identified text and said identified embedded text to locate 
text items in accordance with predetermined sorting rules (See page 7, paragraphs 0141-0145), 
and 
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memory for storing a navigation file containing said text items (See page 11, paragraph 

0216). 

As to claim 2, Goldberg et al. discloses wherein the navigation file links to at least one 
internal document object (See page 1 1, paragraphs 0213-0214). 

As to claim 3, Goldberg et al. discloses wherein the navigation file links to at least one 
external document object (See page 11, paragraph 0213). 

As to claim 5, Goldberg et al. discloses wherein the content parser applies text extraction 
rules to identify text and identify a document structure, wherein the document structure defines a 
context for identified text (See page 11, paragraphs 0218-0220). 

As to claim 6, Goldberg et al. discloses wherein the content parser applies pre-defined 
hierarchical rules for determining a level of identified text (See page 12, paragraph 0236, also 
see page 4, paragraph 0085, and see page 9, paragraph 0183). 

As to claim 7, Goldberg et al. discloses wherein the image processor applies object 
templates to identify embedded text (See page 1 1 , paragraph 021 1). 

As to claim 8, Goldberg et al. discloses wherein the system refines a search resolution 
during a text identifying process to determine a location of the embedded text within an image 
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(See page 11, paragraphs 0213-0329, also see page 15, paragraph 0241). 

As to claim 9, Goldberg et al. discloses wherein identified text comprises hyperlinks (See 
page 7, paragraph 0142). 

As to claim 1 0, Goldberg et al. discloses a graphical User interface system supporting 
processing of a multimedia data file to provide information supporting user navigation of 
multimedia data file content (See page 7, paragraph 0148), comprising: 

a menu generator for generating, one or more menus permitting User selection of, an 
input file and format to be processed (See page 5, paragraphs 001 10-01 15); and 

an icon permitting User initiation of generation of a navigation file supporting linking of 
input file elements to external documents by parsing and sorting text and image content to 
identify text for incorporation in a navigation file (See page 5, paragraphs 0109-01 19). 

As to claim 1 1 , Goldberg et al. discloses wherein identified text comprises hyperlinks 
(See page 7, paragraph 0142). 



As to claim 12, Goldberg et al. discloses wherein the navigation file further comprises 
links to at least one internal document object. 
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As to claim 13, Goldberg et al. discloses a method of creating an anchorable information 
unit in a portable document format document (See page 12, paragraph 0228), comprising the 
steps of: 

extracting a text segment from the portable document format document (See page 12, 
paragraph 0228); 

determining a context of the segment, wherein the context is selected from a context 
sensitive hierarchical structure (See page 4, paragraphs 0085-0093, also see page 15, paragraphs 
0248- 0250); and 

defining the text segment as an anchorable information unit according to the context (See 
page 15, paragraphs 0250-0251). 

As to claim 1 4, Goldberg et al. discloses wherein the portable document format document 
(See page 12, paragraph 0228) includes one or more textual objects including and one or more 
non-textual objects, wherein the objects includes textual segments (See page 12, paragraphs 
0237-0238, also see abstract). 

As to claim 15, Goldberg et al. discloses wherein the step of determining the context 
further comprises the steps of: 

comparing the text segment to a plurality of known patterns within the portable document 
format document (See pages 6-7, paragraphs 0140-0148); and 
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determining the context upon determining a matching the text segment and a known 
pattern of the portable document format document (See pages 16-17, paragraphs 0281-0282). 

As to claim 17, Goldberg et al. discloses wherein the portable document format document 
includes a known context sensitive hierarchical structure (See page 12, paragraphs 0228-0238). 

As to claim 18, Goldberg et al. discloses wherein the context sensitive hierarchical 
structure, including the anchorable information unit is searchable (See page 7, paragraphs 0143- 
0149). 

As to claim 19, Goldberg et al. discloses wherein the context includes a location for the 
extracted text segment (See page 19, claims 44-45 language). 

As to claim 20, Goldberg et al. discloses wherein the step of determining a context 
further comprises the step of determining a location and a style of the text segment (See page 9, 
paragraphs 0171-0176). 

As to claim 22, Goldberg et al. discloses wherein the achorable information unit is 
automatically hyperlinked (See page 7, paragraph 0142). 



Claim Rejections - 35 USC § 103 
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3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

4. Claims 4, 23, 25-28 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Goldberg et al . (U.S. Pub. No. 2002/0080170 Al) in view of Jones et al. (U.S. Pub. No. 
2001/0047373 Al). 

As to claim 4, Goldberg et al. does not teach wherein the image processor comprises a 
black and white image processor comprising: 

a pixel smearing component reducing text to a rectangular block of pixels; and 
an image filtering component for cleaning a smeared image. 

Jones et al. teaches wherein the image processor comprises a black and white image 
processor comprising: 

a pixel smearing component reducing text to a rectangular block of pixels; and 
an image filtering component for cleaning a smeared image (See page 7, paragraphs 
0068-0069). 

Therefore, it would have been obvious to one of ordinary skill in the art at the time of the 
invention was made to have modified Goldberg et al. to include wherein the image processor 
comprises a black and white image processor comprising: 
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a pixel smearing component reducing text to a rectangular block of pixels; and an image 
filtering component for cleaning a smeared image. 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
was made to have modified Goldberg et al. by the teaching of Jones et al. to include wherein the 
image processor comprises a black and white image processor comprising: 

a pixel smearing component reducing text to a rectangular block of pixels; and an image 
filtering component for cleaning a smeared image because it provides for clearer and cleaner 
image and more accurate retrieved data. 

As to claim 23, Goldberg et al. discloses a program storage device readable by machine, 
tangibly embodying a program of instructions executable by the machine (See page 10, 
paragraph 0204) to perform method steps for creating an anchorable information unit file from a 
portable document format document (See page 15, paragraphs 0248-0251), the method steps 
comprising: 

parsing the portable document format document into textual portions and non-text 
portions (See page 11, paragraph 0219); 

extracting structure from the textual portions and the non-text portions (See page 12, 
paragraphs 0236-0238); 

determining text within textual portions, and text the non-text portions (See page 1 1 , 
paragraphs 0212-0214); and 

hyperlinking within the textual portions and non-text portions to a related document (See 
page 7, paragraphs 0142-0145). 
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Goldberg et al. does not teach a plurality of keywords. 

Jones et al. teaches a plurality of keywords (See page 9, paragraph 0090, also see page 
10, claim 10 language). 

Therefore, it would have been obvious to one of ordinary skill in the art at the time of the 
invention was made to have modified Goldberg et al. to include a plurality of keywords. 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
was made to have modified Goldberg et al. by the teaching of Jones et al. to include a plurality of 
keywords because it provides for faster and more accurate database storage and retrieval; this 
method is currently used by well-known search engines. 

As to claim 25, Goldberg et al. as modified discloses wherein the step of extracting 
further comprises the steps of: 

determining a level for extracted textual portions; 
associating the context with the text; and 

pattern matching extracted text to the portable document format document to determine a 
context and a location (See page 15, paragraphs 0241-0251, also see page 1 1 , paragraphs 0212- 
0214). 



As to claim 26, Goldberg et al. as modified discloses wherein the level is one of a 
paragraph, a heading and a subheading (See Jones et al. figure 9b). 
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As to claim 27, Goldberg et al. as modified discloses wherein the step of pattern 
matching (See page 7, paragraph 0148) further comprises the steps of: 

determining a median font size for the portable document format document; 

comparing a font size of the extracted text to the median font size for the portable 
document format document; and 

determining a context according to font size (See page 3, paragraphs 0053-0054, also see 
Jones et al. page 5, paragraphs 0052-0053). 

As to claim 28, Goldberg et al. as modified discloses wherein the step of hyper linking 
further comprises the step of creating the anchorable information unit file, wherein the plurality 
of keywords are anchorable information units (See Jones et al. page 6, paragraphs 0057-0060). 

5. Claims 16, and 21 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Goldberg et al . (U.S. Pub. No. 2002/0080170 Al) in view of Rothermel (U.S. Pub. No. 
2002/0035451 Al). 

As to claim 16, Goldberg et al. discloses wherein the step of extracting text further 
comprises the step of: 

extracting text form an underlying image of the portable document format document; 
determining a type for the image, a grayscale image, and a color image (See page 1 9, claim 43 
language, also see page 16, paragraphs 0273-0276); and 
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processing the image according to the type (See page 16, paragraphs 0273-0279). 

Goldberg et al. does not teach wherein the type is one of a black and white image. 

Rothermel teaches wherein the type is one of a black and white image (See page 2, 
paragraph 0017, also see page 4, paragraph 0035). 

Therefore, it would have been obvious to one of ordinary skill in the art at the time of the 
invention was made to have modified Goldberg et al. to include wherein the type is one of a 
black and white image. 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
was made to have modified Goldberg et al. by the teaching of Rothermel to include wherein the 
type is one of a black and white image because it provides for user customizable image retrieval 
and accurate representation for stored image. 

As to claim 21, Goldberg et al. does not teach further comprising the step of storing an 
extracted text segment in a Standard Generalized Markup Language syntax using a predefined 
grammar. 

Rothermel teaches further comprising the step of storing an extracted text segment in a 
Standard Generalized Markup Language syntax using a predefined grammar (See pages 2-3, 
paragraphs 00 1 9-0024) . 

Therefore, it would have been obvious to one of ordinary skill in the art at the time of the 
invention was made to have modified Goldberg et al. to include further comprising the step of 
storing an extracted text segment in a Standard Generalized Markup Language syntax using a 
predefined grammar. 
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It would have been obvious to one of ordinary skill in the art at the time of the invention 
was made to have modified Goldberg et al. by the teaching of Rothermel to include further 
comprising the step of storing an extracted text segment in a Standard Generalized Markup 
Language syntax using a predefined grammar because SGML provides less processing by 
maximizing limited memory and disk space by providing a complex set of "minimization" rules. 

6. Claim 24 is rejected under 35 U.S.C. 103(a) as being unpatentable over Goldberg et al . 
(U.S. Pub. No. 2002/0080170 Al) in view of Jones et al. (U.S. Pub. No. 2001/0047373 Al) as 
applied to claims 23, 25-28 and further in view of Rothermel (U.S. Pub. No. 2002/0035451 Al). 

As to claim 24, Goldberg et al. as modified discloses wherein the step of parsing further 
comprises the step of differentiating color image content (See page 3, paragraph 0024). 

Goldberg et al. as modified still does not teach from black-and-white content. 

Rothermel teaches from black-and-white content (See page 2, paragraph 0017, also see 
page 4, paragraph 0035). 

Therefore, it would have been obvious to one of ordinary skill in the art at the time of the 
invention was made to have further modified Goldberg et al. as modified to include from black- 
and-white content. 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
was made to have further modified Goldberg et al. as modified by the teaching of Rothermel to 
include from black-and-white content because it provides for user customizable image retrieval 
and accurate representation for stored image. 
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Conclusion 

7. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. 

Ainsbury et al. (U.S. Patent No. 6,078,924) teaches method for performing data 
collection in an information platform. 

Sweet et al. (U S Patent No. 6,567,799 B2) teaches retrieving documents transitively 
linked to an initial document. 

Baclawski (U.S. Patent No. 6,505,191 Bl) teaches distributed computer database for 
employing hyperlinked analysis. 

Marchisio (U.S. Patent No. 6,510,406 Bl) teaches inverse inference engine for high 
performance web search. 

8. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Neveen Abel-Jalil whose telephone number is 703-305-81 14. 
The examiner can normally be reached on 8:00AM-4: 30PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Dov Popovici can be reached on 703-305-3830. The fax phone number for the 
organization where this application or proceeding is assigned is (703) 872-9306. 

Any inquiry of a general nature or relating to the status of this application or proceeding 
should be directed to the receptionist whose telephone number is 703-305-3900. 
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